A new consequence of Simpson’s paradox: Stable co-operation in one-shot Prisoner’s Dilemma from populations of individualistic learning agents
نویسندگان
چکیده
Normative theories of individual choice in economics typically assume that interacting agents should each act individualistically: i.e., they should maximize their own utility function. Specifically, game theory proposes that interaction should be governed by Nash equilibria. Computationally limited agents (whether artificial, animal or human) may not, however, have the capacity to carry out the sophisticated reasoning to converge directly on Nash equilibria. Nonetheless it is often assumed that Nash equilibria will be obtained, in any case, if agents embody simple learning algorithms like reinforcement learning. If so, then learners should converge on Nash equilibria, after sufficient practice in playing a game---and hence, for example, individualistic agents should end up playing D (defect) in one-shot Prisoners’ Dilemmas (PD). In an experiment and in a multi-agent simulation, we show, however, that this is not always the case---under certain circumstances, reinforcement learners can converge on co-operative behaviour in PD. That is, even though each agent would receive higher pay-off from switching to D, agents obtain more reinforcement, on average, from playing C, and hence C is more strongly reinforced. This effect arises from a well-known statistical paradox, Simpson’s paradox. We speculate that this effect may be relevant to some aspects of real-world human co-operative behaviour.
منابع مشابه
Simpson’s Paradox Can Emerge from the N-Player Prisoner’s Dilemma: Implications for the Evolution of Altruistic Behavior
Simulations of the n-player Prisoner’s Dilemma in multiple populations reveal that Simpson’s paradox can emerge in such game-theoretic situations. The relative proportion of cooperators can decrease in each separate sub-population, while the proportion of cooperators in the total population can nonetheless increase, at least transiently. Factors that determine the longevity of this effect are u...
متن کاملGroup Biases in the Prisoner's Dilemma Game: Looking for Simpson's Paradox Effects on Cooperation
The role of Simpson’s paradox effects on cooperation in one-shot intraand inter-group Prisoner’s Dilemma games is explored. Three experimental conditions are considered in a between-subject design – a group that plays games only within the group (intra-group condition), a group that plays games only with members of the other group (intergroup condition), and a group which plays a combination of...
متن کاملN-Player Prisoner’s Dilemma in Multiple Groups: A Model of Multilevel Selection
Simulations of the n-player Prisoner’s Dilemma (PD) in populations consisting of multiple groups reveal that Simpson’s paradox (1951) can emerge in such gametheoretic situations. In Simpson’s paradox, as manifest here, the relative proportion of cooperators can decrease in each separate group, while the proportion of cooperators in the total population can nonetheless increase, at least transie...
متن کاملA non-cooperative Pareto-efficient solution to a single-shot Prisoner's Dilemma
The Prisoner’s Dilemma is a simple model that captures the essential contradiction between individual rationality and global rationality. Although the one-shot Prisoner’s Dilemma is usually viewed simple, in this paper we will categorize it into five different types. For the type-4 Prisoner’s Dilemma game, we will propose a selfenforcing algorithmic model to help non-cooperative agents obtain P...
متن کاملModeling Cooperation between Nodes in Wireless Networks by APD Game
Cooperation is the foundation of many protocols in wireless networks. Without cooperation, the performance of a network significantly decreases. Hence, all nodes in traditional networks are required to cooperate with each other. In this paper, instead of traditional networks, a network of rational and autonomous nodes is considered, which means that each node itself can decide whe...
متن کامل